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Abstract 

Metric embedding has become a common technique in the design of algorithms. Its apphca- 
bihty is often dependent on how high the embedding's distortion is. For example, embedding 
finite metric space into trees may require linear distortion as a function of its size. Using 
probabilistic metric embeddings, the bound on the distortion reduces to logarithmic in the size. 

We make a step in the direction of bypassing the lower bound on the distortion in terms 
of the size of the metric. We define "multi-embeddings" of metric spaces in which a point is 
mapped onto a set of points, while keeping the target metric of polynomial size and preserving 
the distortion of paths. The distortion obtained with such multi-embeddings into ultrametrics is 
at most 0(log Aloglog A) where A is the aspect ratio of the metric. In particular, for expander 
graphs, we arc able to obtain constant distortion embeddings into trees in contrast with the 
fl(logn) lower bound for all previous notions of embeddings. 

We demonstrate the algorithmic application of the new embeddings for two optimization 
problems: group Steiner tree and metrical task systems. 

1 Introduction 

Finite metric spaces and their analysis play a significant role in the design of combinatorial algo- 
rithms. Many algorithmic techniques were introduced in recent years concerning and using metric 
spaces and their approximate embedding in other spaces, see the surveys [201 1^ for an overview 
of this topic. 

Definition 1. An embedding of a metric space M = (Va//, d^/) into a metric space N = (Vat, d^r) is 
a mapping (f) : Vm Vjy. The embedding is called non- contractive if for all u,v £ Vm, dM{u,v) < 
(iAT ((/)(«), 0(f)) and has distortion at most a if in addition for all u,v G Vm, diy{(p{u), (p{v)) < 
a ■ dM{u,v). A non-contractive embedding whose distortion is at most a is called a-embedding. 

The general framework for applying metric embeddings in optimization problems is to embed 
a given metric spaces into a metric space from some "nice" family and then apply an algorithm 
for that space. As a result, the approximation ratio increases by a factor equal to the embedding's 
distortion. 

*A preliminary version of this paper appeared in |1L)I . 
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Among others, embeddings into low dimensional normed spaces |121l24j as well as probabilistic 
embeddings into trees [2101^113 have many algorithmic applications. In both cases the distortions 
of the embeddings are logarithmic in the size of the metric. Unfortunately, there is a matching 
lower bound on the distortion of these embeddings as well, which sets a limit to their applicability. 
This paper presents a partial remedy for this problem. 

Tree metrics, and in particular ultrametrics, seem a natural choice as a target class of "simple" 
metric spaces. Unfortunately, standard embedding is not useful when the target space is a tree 
metric. Embedding arbitrary metric spaces into trees requires distortion linear in the size of the 
metric space ^7\. Probabilistic embedding |2] provides a way to bypass this problem: 

Definition 2 (Probabilistic Embeddings). A metric space M = {VM,dM) is a- probabilistically 
embedded in a set of metric spaces S if there exists a distribution V over S and for every G 5, a 
non-contractive embedding : Vm — ^ Vn, such that for all u,v G Vm, ¥.j\f^x)[dj\f{(j)]\f{u), 4'n{v))] < 
a ■ dM{u, v). 

Using probabilistic embeddings, it is possible to obtain much better bounds on the distortion 
[B la m CSl H . The following bound is shown in [13 i] : 

Theorem 1. Any metric space on n points can be O(logn) probabilistically embedded in a set of 
n-point ultrametrics. Moreover, the distribution can be sampled efficiently. 

Theorem^found many algorithmic applications in approximation algorithms, online algorithms, 
and distributed algorithms, see for example El ^1 12^1 EI- The bound on the distortion in 
Theorem^is tight even for probabilistic embeddings into tree metrics for which there is an 0(log n) 
lower bound [2]. 

Theorem^ was originally formulated for a class of metric spaces defined by the following natural 
generalization of ultrametrics: 

Definition 3 ((2]). For A; > 1, a k-hierarchically well-separated tree (/c-HST) is a metric space 
defined on the leaves of a rooted tree T. To each vertex u gT there is associated a label A(u) > 
such that A{u) = if and only if is a leaf of T. The labels are such that if a vertex v is a child of a 
vertex u then A(f) < A{u)/k. The distance between two leaves x,y € T is defined as A(lca(x,y)), 
where lca(3;, y) is the least common ancestor of x and y in T. 

The definition of finite ultrametric is the same as a 1-HST. Any /c-HST is therefore, in particular, 
an ultrametric and any finite ultrametric can be fc-embedded in some /c-HST jS]. We can therefore 
restrict our attention to ultrametrics, while all results generalize to A;-HSTs. 

The main contribution of this paper is in offering a new type of metric embedding that makes 
it possible to bypass lower bounds for the standard and even probabilistic metric embeddings. 
There are two key observations that lead to this new type of embedding. The first is that in 
some applications it is natural to match a point onto a set of points in the target metric space. 
Motivated by two applications of Theorem ^ the group Steiner tree problem (henceforth, GST), 
and the metrical task systems problem (henceforth, MTS), we propose the following definition: 

Definition 4 (Multi Embedding). A multi embedding of M in is a partial surjective function 
f from A^ on M, i.e. each point x G M is embedded into a non-empty set ^~^{x). Points in f^^(x) 
are called representatives of x in A^. 
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The role of f ~^ in Definition|l]is analogous to the role of (j) in the Definitions^andElof embedding 
and probabilistic embedding. Another way to define multi embedding is by : M — > 2'^, in which 
(f){u) n (t>{v) = for every u ^ v. In our notation we have (t){x) = ^~^{x). Since the f notation will 
be more convenient, henceforth we will exclusively use it. 

The second observation is that for many applications, including those mentioned above, there 
is no need to approximate the original distance for every pair of representatives. What is really 
needed is that every path in the original space will be approximated well by some path in the target 
space. 

A path in a metric space is an arbitrary finite sequence of points in the space. The length of a 
simple path p = {ui,U2, ■ ■ ■ , Um) in a metric space M = {V, d) is defined as = Yll=i^ d{ui, Uj+i). 

Definition 5 (Path Distortion). A multi-embedding f of M in A^, is called non- contractive if for 
any u,v (z N, di\f{u,v) > dftf (f(n), f(z;)). The path-distortion of a non-contractive multi-embedding 
of M in A^, f : — i- M, is the infimum over a, for which any path p = {ui,U2, ■ ■ ■ , Um) in has 
a path p' = {u'i,U2, . . . , u'^) in A^ such that f(u^) = Ui and £{p') < a ■ i{p). 

A multi embedding whose path-distortion is at most a is called Q-path embedding. 

A crucial parameter for multi-embeddings is the size of the target space T. In general, it will 
be desirable that T will be polynomial in the size of the source space. In fact, if F = oo then there 
is a simple 1 path embedding of any finite metric space by trees: Take all finite paths, convert each 
path to a simple path (by duplicating points, if necessary), and put them under a single root with 
an edge of length half the diameter. This motivates a study of the trade-off between F and the 
path distortion of arbitrary metric spaces by tree metrics. In Section^ we study path embedding 
of expander graphs and the hypercube into tree metrics. We show e.g. that an n-point Ramanujan 
graphs have 3-path-embedding into tree metrics of size F(n) < poly(n). This is in sharp contrast 
to the status of expander graphs for previous notions of embeddings for which they are considered 
"worst case" examples with r2(logn) distortion These results directly imply nearly tight results 
on the approximation ratio for GST on expander graphs and hypercubes. 

We consider multi embeddings when the class of target metric spaces are ultrametrics. First, we 
observe that probabilistic embedding into ultrametrics directly implies a bound for path embedding 
by putting all the trees in S (the set used in the probabilistic embedding) under a common new 
root. This results with an a-path embedding into an ultrametric of size \S\n. Using the bound of 
|13j on the number of ultrametrics needed in Theorem ^ we obtain an O(logn) path embedding 
into an ultrametric of size ©(n^logn).^ 

An important parameter of the metric spaces appearing in practice is the aspect ratio of the 
metric, which is the ratio between the diameter and minimum non-zero distance in the metric 
space. It will be convenient for us to assume that the minimum distance is 1, and so the aspect 
ratio becomes the diameter. It turns out that the aspect ratio of the metric plays a significant role 
in the path distortion of multi-embeddings. In Section |21 we prove: 

Theorem 2. Fix /3 > 1. For any metric space Ad = {V,d) on \V\ = n points and aspect ratio A, 
there exists an efficiently constructible multi- embedding into an ultrametric of size , whose path 

^In a preliminary version of this paper IIUI . we also introduced the notion of probabilistic multi- embedding. Using 
that notion we were able to show probabilistic multi-embedding into ultrametrics of polynomial size and path distor- 
tion 0(log n log log log n). Since an O(logn) bound now follows from Theorem ITI 1151 HI. we have decided to drop the 
probabilistic multi-embedding result from this version. 
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distortion is at most 

Ojs (minjlog n ■ log log n , log A • log log A } ) . 

Our construction beats the probabilistic embedding based constructions on metrics with small 
aspect ratio. Expander graphs are examples where a lower bound of Q(A) exists on probabilistic 
embedding using trees |24j . 

The constructions of multi-embeddings are in a sense dual to Ramsey-type theorems for metric 
spaces jnilH]) where the goal is to find a large subset which is well approximated by some ultrametric. 

We also provide a simple example in which Theorem [21 is almost tight: Any a-path embedding 
into ultrametrics of a simple unweighted path of length n has a = ^{log n). It follows, in particular, 
that any a-path embedding into ultrametrics of the metric defined by an unweighted graph of 
diameter A has a = Jl(log A). Path embedding is motivated by two intensively-studied algorithmic 
minimization problems: GST and MTS, mentioned above. For both, the best known algorithms 
use probabilistic embedding into trees/ultametrics. In Section [2 we prove that in order to reduce 
these problems to other metric spaces it is sufficient to use path embedding. We therefore achieve 
improved algorithms for these problems whenever the path embedding distortion beats that of 
probabilistic embedding, and in particular, when the underlying metric is of small aspect ratio. 

2 Applications 

In this section we define MTS and GST show that path distortion of multi-embeddings reduces 
these problems to similar problems with different underlying metrics. 

Metrical Task Systems (MTS) was introduced as a framework for many online minimization 
problems. A MTS on metric space M = {S,d), \S\ = n, is defined as follows. A "system" has a 
set of n possible internal states S. It receives a sequence of tasks a = T1T2 • • • r^. Each task r is 
a vector r : S" — > M"*" U {00} of nonnegative costs for serving r in each of the internal states of the 
system. The system may switch states (say from u to v), paying a cost equal to the distance d{u, v) 
in M, and then pays the service cost r(f ) associated with the new state. The major limiting factor 
for the system is the requirement to process the sequence in an online fashion, i.e., serving each 
task without knowing the future tasks. 

As customary in the analysis of online algorithms, MTS is analyzed using the notion of compet- 
itive ratio. A randomized online algorithm A is called r-competitive if there exists some constant c 
such that for any task sequence a, E[costA(o")] < r ■ costopT(o') + c, where costA(o") is the random 
variable of the cost for serving a by A, and costopT(c) is the optimal (offline) cost for serving a. The 
current best online algorithm for the MTS problem in n-point metric spaces is 0(log^ nloglogn) 
competitive ^JEl (an improvement of 0131) • Both papers (51 El actually solve the MTS problem 
for ultrametrics, and then reduce arbitrary metric spaces to ultrametrics using Theorem^ We next 
show that path embedding suffices: 

Proposition 1. Assume that a metric space M is a-path embedded in N. Assume also that N has 
an r-competitive MTS algorithm. Then there is an ar- competitive algorithm for M. 

Proof. We construct an online algorithm A for M as follows: Let An be an r-competitive online 
algorithm for N, and f : — > M an a path embedding of M in N. The task sequence a is 
translated to a task sequence for N task by task as follows. A task r for M is translated into 
a task for N such that {u') = T(f(ii')). A maintains the invariant that if An is in state v', 
then A is in state f{v'). 
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It is easy to verify that cost_4((T) < cost_4^((T^), since the service costs are the same, and the 
distances in N are larger. Consider Opt((t), it defines a path p of serving a in M. Thus there 
exists a path as in the statement of Definition |S1 The path p^ is the way would be served 
in N. In this way, since f{p^) = p, the service costs in are the same as the services costs in M, 
and i{p^) < ai{p). Thus costopTjv(o"^) < « ■ costopT(f'")- Summarizing: 

E[cost^((7)] < E[cost^j^ (cr^)] < r ■ costopTjv (<^^) + c < ar ■ costopT(<7) + c. 

□ 

Corollary 2. There is an 0(log A log log A • log n log log n)- competitive randomized MTS algorithm 
for MTS defined on metric spaces with diameter A. 

Proof. Apply Theorem |21 on the original metric and obtain an O (log A log log A) path embedding 
into an ultrametric of size T{n) = poly(n). This ultrametric has 0(logr(?7,) loglogr(n)) competi- 
tive algorithm JB]- Now apply Proposition^ to obtain the claim. □ 

The Group Steiner Tree Problem (GST) [SHI can be stated as follows: Given a graph G = (V, E) 
on n vertices with a weight function c : E ^ M_|_ , and subsets of the vertices gi, ■ ■ ■ ,gk C V (called 
groups), the objective is to find a minimum weight subtree T of G that contains at least one 
vertex from each g^, i G [k]. Under certain standard complexity assumptions, this is hard to 
approximate by a factor better than maxllog^"*" log^""^ n} |19j . The current best upper bound 
on the approximation factor is 0(log^ nlog A;) jl7l I15j. In J7j, an 0(lognlogA;) approximation 
algorithm for tree metrics is given, and the general case is reduced to tree metrics using Theorem^ 
Again, we show that it is actually sufficient to use multi embedding for this problem. 

As a first step we observe that the problem can be easily cast in terms of metric spaces instead 
of graphs: Given a graph G = {V,E) with weights w : E ^ M+, let M = iV^d) be the shortest 
path metric induced by G and w onV . A tree T in M can be transformed into a tree T in G such 
that the total weight in T is not larger than the total weight in T, and T contains all the vertices 
in T. This is done by replacing each edge in T by the shortest path between its endpoints in G, 
and taking a spanning tree of the resulting subgraph. It therefore suffices to solve GST on metric 
spaces. 

Proposition 3. Assume that a metric space M is a path embedded in a metric space N . Assume 
in addition that there is a [randomized] polynomial time r approximation algorithm for any GST 
instance with k groups defined on N. Then there exists a [randomized] polynomial time 2ar- 
approximation algorithm for any GST instance with k groups defined on M . 

Proof. We construct an approximation algorithm A for the instance 
a = (M; gi, . . . , g^) as follows. Denote by f : ^ M the a path embedding of M in A^. Consider 
the following instance of GST: = {N]f~^{gi), . . ■f~^{gk))- Let An be an r-approximation al- 
gorithm for ctat. Let T/v = An{o-n) be the tree constructed by Aj\f. Denote by f{Tj\f) the image 
graph of Tn. I.e., if Tn = {Vn, En), then f{TN) = (K^n), {Hu)Hv)\ uv G En}). The graph '{{Tn) 
is a connected and its weight is at most the weight of Tn- It also spans at least one representative 
form each group. Algorithm A returns a spanning tree of f(T/v). This tree is a feasible solution 
and it satisfies costx((T) < cost^^^ (cttv)- 

Consider the tree Opt((7), double each edge in OPT(cr) and take an Euler tour p of this graph. 
There exists a path in A^, pN, as in the statement of Definition |S1 such that f(pAr) = p. The path 
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Pn is a connected graph and spans at least one representative from each group f ^{gj)- As the 
weight of p is twice the weight of Opt(cj), we have 

costopTiv ("^Af) < Kpn) < ai(j>) < 2acostopT(c)- 

Summarizing: 

E[cost^(cr)] < E[cost^j^((TAr)] < r costopTiv(<7Af) ^ 2Qr costopT(o"). 

□ 

Corollary 4. There is an polynomial time 0(log Aloglog Alognlog/c) approximation algorithm 
for GST on metric spaces with diameter A. 

3 Multi Embedding into Ultrametrics 

The following theorem is a restatement of Theorem |2l in a more general form. 

Theorem 3. Given any metric space M = {V,d) on \V\ = n points and diameter A, for any 
t G N, M is 0(tmin{log A,logn}) path embedded into an efficiently constructive ultrametric of 
size r < n^, where {3 = min{(log n)-*^/*, [t log(4A)]^/*}. 

Proof. The construction of the multi-embedding is motivated by the construction of subspaces 
approximating ultrametric in [HJ|H], but instead of deleting points, we duplicate them. We then 
prove the bounds on the path distortion. 

Let A be the diameter of M . Let x and x be two points realizing the diameter of M, and assume 
without loss of generality that \{y G M : d{x,y) < A/4}| < n/2 (otherwise, switch the roles of x 
and x). Define a series of sets Aq = {x}, and for i £ {1,2, ... , t}, Ai = {y £ M\ d{x, y) < iA/At}, 
and "shells" Si = Ai \ Ai^i. Let \V\ = n and let £i = \Ai\/n. 

The algorithm for constructing the multi-embedding works as follows: Choose a shell Si, i £ [t]. 
Recursively, construct a multi-embedding of the sub space Ai into an ultrametric Ti and a multi- 
embedding of the subspace V \ into an ultrametric T2. To construct the multi-embedding for 
M, we construct an ultrametric T with root labelled with A, and two children, one is Ti and the 
other is T2. This is a multi-embedding since the points in Si are essentially being "duplicated" at 
this stage. Note that this is a non-contractive multi-embedding. 

Next we prove an upper bound on the size of the resulting ultrametric T, assuming that the shell 
was chosen carefully enough. The bound we prove is n^, where /? = min{(log n)^/*, [4 log(4A)]^/*}. 

We begin with the first bound. Let (3 = (3{n) = (logn)^/*. The proof proceeds by induction 
on n (whereas t is fixed). There must exist i G [t] such that > e^. Indeed, note that 

n"^ < £0 ^ £t+i < 1/2. Assume for the contrary that ej_i < for all i G [t], then 

£0 < £1 < ■ ■ ■ £t ^ (■2'^ ~ n' 

which is a contradiction. Therefore we can fix i such that ej_i > . Inductively, assume that 
the recursive process results in at most (ejn)^^'^'") < (ejn)^ leaves in Ti and at most ((1 — 
ei_i)n)^«i-='-iH < ((1 - £i^i)nf leaves in T2. So |r| < (ef + (1 - ei_i)^)n^. Since £i-i > ef , 
we have ef -|- (1 — £i-i)^ < £i-i + (1 — Ej-i) = 1 and we are done. 
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We next prove the second bound. Let (3 = /3(A) = [t log(4A)]^/*. The proof is by induction on 
(the rounded value of) A. We claim that 

Indeed, assume for the contrary that no such i exists. Set a = log(2A) > 1, so that /3(A/2) = (ta)'^^^ 
and /3(A) = [t{a + 1)]^/*. Denote b = nM'^*-[*(«+i)]''" and c = (ta)^/*. The opposite of © then 
becomes e^-i < ef6, for any i G [t]. Iterating this t times we get: 



n 

So that: 



n 



(ta)2-2/*[[i(a+l)]2/*-(ia)2/*] ^ 



but this is a contradiction, since an application of the mean value theorem implies the existence of 
^ € [a, a + 1], for which 

/ \ i-2/t 

(ta)2-2/* [[t(a + 1)]^/* - (ta)2/*] = {taf-^/'[2t-^+^/'r'^^/'] = 2{ta) > ta > 1. 

Choose an index i € {1, ■ ■ ■ ,t} satisfying (©. Since i < t, A{Ai) < A/2. The choice of the 
index i, and using the inductive hypothesis, gives the required lower bound on the cardinality of T 
since: 

|rl < (e,n)^(^/2) + [(1 - e,„i)n]^(^) < ei„in^(^) + (1 - e,_i)n^(^) < n^^^l 



We note that the running time of the algorithm above is 0{v?) on each vertex in the tree and 
therefore 0(n^+^) for the whole tree . A slight variation on this algorithm (and a more careful 
analysis) has an 0{n™^^^^''^^) running time. 

The multi-embedding described above has the following properties: 

1. The multi-embedding is non-contractive. 

2. The tree structure defining the ultrametric is a binary tree.^ 

Let u be an internal vertex in the binary tree defining the ultrametric, and T the subtree rooted 
at u. We can rename the subtrees rooted with the children of u, as Ti and T2 such that: 

3. Let X and y two points in M. If / f"i(x) n T C Ti and / ^~^{y) n T C T2, then 
d{x,y)>/\(T)/At. 

4. |f(Ti)| < |f(r)|/2. 

5. A(ri) < A(r)/2. 

We next show, using the properties above, that the path distortion of this multi embedding 
is at most 8tlogmin{n, A}. Let p = {ui,U2, ■ ■ ■ ,Um) be a path in M whose length is i{p) . We 
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Ti T2 

Figure 1: A partition of the path to subpaths. 

construct a path p on the leaves of T whose length satisfies i{p) < 8t log min{n, A} • i{p) . The proof 
proceeds by induction on the height of the tree defining the ultrametric. 

We partition p into sub-paths as follows. Define a sequence of indices and a sequence of sub-trees 
of the root: Let ji = 1 and let Ti G {Ti,T2} be the subtree of the root that includes the longest 
prefix of p. Assume inductively that we have already defined and Tj_i € {Ti,T2}. Define ji to 
be the minimum index such that Uj^ is the first point in p after Uj._-^ with no representative in Tj-i. 
Let Ti € {Ti, T2} be the other subtree of the root. Assume this process is finished with js, Tg. Next 
we define another sequence of indexes: kg = m, for i < m we define ki to be the largest number, 
smaller than ji+i, such that u^. does not have a representative in Tj+i. By the construction of Ti, 
we have that ji < ki and Uk^ has a representative in Tj. See Figure Q for example of such partition. 
We have partitioned p into sub-paths {{uj., . . . , nfci))i and {{u}^., . . . , Uj-_^-^))i. Informally, a sub-path 
{uj., . . . , Uki) will be realized in Ti, while sub-path (ufc. , . . . , Uj^^^) will be realized in Ti. 

More formally, let L = i{p), Li^^i^ = ^{{ui^,Ui^+i, . . . ,Ui^)), n = \f{T)\, m = |f(ri)|, n2 = 
|f(r2)|, and A = A(r), Ai = A(ri), and A2 = A(r2). We construct by induction on the tree 
structure T a path p in T whose length satisfies L = l(p) < 8tlogmin{A,n} • L. 

By the induction hypothesis it is possible to construct for any i, a path in Tj of representatives 
of {uj- , . . . , ) whose length is 

Lj^,k, < Lj^^k, ■ 8tlogmin{n, A}. 

Next, for any i, we construct a path of representatives of (n^. , . . . , Uj^^-^^). Note that Ufc.+i, . . . , Uj-^-^^i 
have representatives in both Tj and Tj+i. Therefore, we construct inductively a path from a rep- 
resentative of Uki+i to a representative of Uj-^-^^i in Ti, so L^. j-^^ < L^. j-^^8tlogmin{77,i, Ai}. 
We then connect the representative of u^^ with the representative of Uk^+i and the representative 
of itj._^^_i with the representative of tixi+i, each such edge is of length at most the diameter of 
T, A. We have therefore constructed a path of representatives of (iifc. , . . . ,Uj^^-^) whose length is 
Lfc. o-^, + 2A. 

Since itfc. does not have a representative in Tj+i and Uj.^^ does not have representative in Ti, 
we conclude using property © above, that dM{uki,Uji+i) > A/4t, and so A < 4t • Lfc.j-^-^. To 

^Note that more generally, any ultrametric can be defined by a binary tree. 
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summarize, 



Lk„ji+i < Lfc^,j,+i8tlogmm{77,i, Ai} + 2A 
< Lfc^jv+,8t(logmin{n/2, A/2} + l) 
= ^fc„ii+i8tlogmm{n, A}. 

We conclude, 

s s—1 
i=l i=l 

s s—1 

< 8tlogmin{n, A} • + Y,Lk,,M+i) 

i=l 1=1 

= 8t log min{n, A}L. 

□ 

We end the discussion on multi embedding into ultrametrics with the following impossibility 
result. 

Proposition 5. Consider the metric defined by a simple N -point path. Then any a path- embedding 
of this metric in an ultrametric must have a = r2(logn). 

Proof. Let M = {vi,V2, . . . ,Vn} be the metric space on n points such that dM{vi,Vj) = \i — 
j\. We prove that for any non-contractive multi-embedding into an ultrametric T, any path of 
representatives of (f i, f2, • • • , Vn) is of length at least g{n) = | log n. 

The proof proceeds by induction on n. For n = 1 the claim is trivial. For n > 1, let 
{v[,V2, . . . ,v'^) be a path of representatives in T. Let u = lcaT{v[,v'^), A(u) = dT{v[,v'n) > 
duivij^n) = n — 1. Let Ti be the subtree of the child of u that contains v[. Ti does not contains 
v'^. Let ii < n be the maximal i such that {v'l, . . . ,f -^l C Ti. As ii + 1 is not contained in Ti, it 
must be that dT{v[^,v[^_y,]) > A(n) > n — 1. By the induction hypothesis 

£t(K,...,<)) ><7(n), ^t(«+i,...,<)) >9(n-ii). 

Since (7 is a convex function, {g{ii) + g{n — ii))/2 > g{{ii + (n — zi))/2) = g{n/2). We conclude 

£t(K, . . . , v'^)) > g{ii) + (n - 1) + g{n - n) 

> 2g{n/2) + n - 1 = 2^ log f + (n - 1) > § logn. 

□ 

4 Multi-Embedding into Trees 

In this section we consider multi embeddings into arbitrary tree metrics. We only have preliminary 
results. Specifically, we only consider two important types of metric spaces: expander graphs and 
the discrete cube with the Hamming metric, for which we obtain better results. For both of them 
the preceding sections proved an upper bound of O(loglognlogloglogn) and a lower bound of 
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O(loglogn) on the path distortion of multi-embeddings into ultrametrics (the lower bound follows 
since both metrics contain a path of length O(logn)). 

We begin with the observation that for F = oo it is easy to obtain 1 path embedding of any 
finite metric space into trees. This is achieved by defining an infinite tree metric as follows: joining 
all possible finite paths with a common root, where the first node in the path is connected with an 
edge weight of A/2 to the root. Moreover, 

Proposition 6. Given a metric space M defined by an unweighted graph of maximum degree d and 
diameter A, and let s S N. Then M can be (2 + ^) path-embedded into a tree metric of size nd^ . 

Proof. Along the lines of the construction described above, we take all paths of length s, and join 
these with a common root, where the first node in the path is connected with an edge weight of 
A/2 to the root. Obviously, there are at most nd^ such paths. Notice that our choice of weights to 
the edges adjacent to the root guarantees that distances in the resulting tree are no smaller than 
the original distances. We next claim that the path distortion is at most (2 + y)- To see this, 
consider a path p = {vi, . . .v^) of length L We partition p into sub-paths of length s: p = piP2 • • - pt, 
where t = \i/s] , pj = {v(^j_i)s+i, ■■■ , Vjs) for j < t, and pt = {v(^t-i)s+i, ■ ■■,ve)- Now the sub-path 
Pj is mapped to the appropriate path in the tree. Note that the length of the image path is 
2^ + (i-l)A. □ 

This simple fact is particularly interesting for its implication for expander graphs. Let G be an 
{n,d,jd) graph, i.e., a d-regular, n- vertex graph whose second eigenvalue in absolute value is at 
most 'jd. It is known that such a graph has diameter at most 1 -|- log^/^ n, and so we obtain: 

Corollary 7. Any (n,d,^d) -graph has 3 path embedding in a tree of size dn^^^°^'^^'>'^ . 

We also note that for the trees constructed in the proof of Proposition |HJ it is particularly easy 
to obtain a better approximation algorithm for GST. 

Lemma 8. Gonsider a tree metric M = {V, d), where F = Pi U P2 U • • • U P^, Pi = {v\,v\, . . . , v\) 
is an unweighted simple path of length s, and d{v\,v\) = A for i j- Then an instance of the GST 
with k groups defined on M has (1 -|- ^)(1 + Ink) approximation algorithm. 

Proof. Consider a GST instance gi, ■ ■ ■ gk defined on M. We first check whether there is a solution 
that is completely contained in one Pi. This can be checked in polynomial time by noting that if 
an optimal solution is contained in one Pi then it is an interval. Thus all is needed to be checked 
are ^(*^^) intervals. 

Otherwise, the optimal solution intersects t > 1 of the paths Pi,. . .Pe. Define a Hitting Set 
instance whose ground set is {Pi, . . . , Pi} and the subsets are g[, . . . , g'j^, where g'^ = {Pj; Pj H gi ^ 
0}. It follows that the optimal cost of the hitting set problem is at least t — 1. The Hitting Set 
problem has a polynomial time In A; approximation algorithm |22| l25] . Let S be the approximate 
solution for the hitting set. We define a solution for the GST instance by taking a natural path 
over U{Pi; Pi € S}. Note that its length is at most (A + 2s)\S\ < {A + 2s){t - 1)(1 + Ink). But 
the cost of the optimal GST algorithm is at least (t — 1)A. □ 

Corollary 9. For fixed d > X, There exist constants c = Cd,\, C = Cd,\, and polynomial p{t) = 
Pd,x{t) such that GST on {n,d,X) graphs has p{n) -time (Clog A;) approximation algorithm, and it 
is NP-hard to approximate within a factor of clog k. 
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Proof. The approximation algorithm fohows from Proposition 1^1 and Lemma |H1 by setting s = A. 
The hardness result follows since an (n, d, A) graph contains a subset of n^'i,\W points that is 
Od,x{l) approximated by an equilateral space [S]. GST on equilateral space is equivalent to a 
standard Hitting Set problem, which is NP-hard to approximate within a factor of c In A; [28| I26j . 
Usage of points not in this subspace ("Steiner points") can improve the approximation factor by 
at most a factor of two |29[ I18j . □ 

We next examine multi-embedding of the /i-dimensional hypercube with n = 2^ vertices. Using 
Proposition ini with s = /i/ log h, and using d = h and A = h, we obtain (log logn + 2) path embed- 
ding into trees of size T{n) < v? . Using Lemma|Hlit also implies a polynomial time 0(log k log log n) 
approximation algorithm to GST on the cube. Similarly to the expander graphs, it is hard to ap- 
proximate instances of GST on the cube to within a clog/c factor, for some constant c > 0, since 
the cube contains a subset of n^^^^ points that is 0(1) approximated by an equilateral space. 

5 Discussion 

An interesting open problem is to determine worst case bounds for path distortion of multi embed- 
ding into trees of polynomial size. As indicated by the case of expander graphs, such bounds may 
be better than those for ultrametrics. 

Results on multi-embedding into trees directly reflect on the approximability of GST. As shown 
for expanders and hypercubes, it is possible that for special classes of metric spaces, a combination of 
improved path embedding and a specialized solution would yield (nearly) tight upper bounds. Our 
approach to show the (near) tightness of the results in those cases stems from metric Ramsey-type 
considerations (i.e. the existence of large approximately equilateral subspace). Such considerations 
are in fact more general and may lead to more results of this flavor. ^ 

Multi-embedding into ultrametrics, also implies multi embedding into where d = O(logn) 
with similar path distortion jn|. It is natural to ask whether better path distortion is possible 
for multi-embedding into ii or £2- Further study of multi-embeddings in other settings and their 
applications seems an attractive direction for future research. 
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